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Abstract — In this paper, we investigate the problem of 
assignment of K identical servers to a set of N parallel queues 
in a time slotted queueing system. The connectivity of each 
queue to each server is randomly changing with time; each 
server can serve at most one queue and each queue can be 
served by at most one server per time slot. Such queueing 
systems were widely applied in modeling the scheduling (or 
resource allocation) problem in wireless networks. It has 
been previously proven that Maximum Weighted Matching 
(MWM) is a throughput optimal server assignment policy for 
such queueing systems |1|, |2|. In this paper, we prove that 
for a symmetric system with i.i.d. Bernoulli packet arrivals 
and connectivities, MWM minimizes, in stochastic ordering 
sense, a broad range of cost functions of the queue lengths 
including total queue occupancy (or equivalently average 
queueing delay). 

I. Introduction 

Optimal stochastic control of emerging wireless net- 
works is one of the primary objectives in the design of such 
networks. In general, the main goal in the stochastic control 
of wireless networks is to distribute the shared resources in 
physical (e.g. power) and MAC layers (e.g. radio interfaces, 
relay stations and orthogonal channels) to multiple users 
such that a certain stochastic performance attribute is 
optimized. While various performance attributes including 
the stable throughput region, power consumption and utility 
functions of the admitted rates have been studied in many 
papers, average queueing delay has been considered far 
less in literature. This is due to the inherent difficulty of 
delay optimal scheduling problems in queueing systems 
with time varying channel conditions. In this paper, we 
consider a discrete time queueing system which is suitable 
in modeling of orthogonal resource assignment (e.g. radio 
interfaces/channel allocation) in multi-user wireless access 
networks. In our system, we model the available shared 
resources by a set of identical servers. The model also con- 
sists of a set of queues whose connectivities to each server 
is changing by time randomly. Therefore, the resource 
assignment problem is equivalent to finding a matching 
between the queues and the servers at each time slot such 
that some performance objectives are optimized. It has been 
already shown that Maximum Weighed Matching (MWM) 
is throughput optimal for such a system, i.e., it maximizes 



the stable throughput region of the system ifD, |2|. MWM 
has also been extensively used in literature for treating 
the scheduling problem in crossbar packet switches Q- 
(6j- In this paper, we prove that for a symmetric system 
with i.i.d. Bernoulli arrivals and connectivities (i.e. with 
the same arrival and connectivity parameters for all the 
queues), MWM is also optimal in minimizing, in stochastic 
ordering sense, a broad range of cost functions of queue 
lengths including total queue occupancy (or equivalently 
average queueing delays In other words, we show that 
MWM policy minimizes stochastically a broad range of 
cost functions of queue length processes including the 
expected total queue occupancy across all possible server 
assignment policies. 

The problem of optimal server allocation in queueing 
systems with random connectivities was mainly addressed 
in fl], 0, GJ-Ol- In ID. the authors introduced the 
notion of stability region of a general queueing network 
with time varying connectivities and they proposed back- 
pressure algorithm as a throughput optimal resource alloca- 
tion policy for queueing networks. In [7|, they considered 
a multi-queue single-server queueing system with random 
connectivities. They characterized the stability region by 
a set of linear inequalities and also proved that for a 
symmetric system with the same arrival and connectivity 
parameters for all the queues, LCQ (Longest Connected 
Queue) provides the optimal performance in terms of 
average queue occupancy. 

In ifTTI . Maximum Weight (MW) policy was proposed 
as a throughput optimal server allocation policy for multi- 
queue multi-server queueing systems with stationary chan- 
nel processes. In lfl"3l . the authors characterized the net- 
work capacity region of multi-queue multi-server queue- 
ing systems with time varying connectivities. They also 
obtained an upper bound for the average queueing delay 
of AS/LCQ policy which is a throughput optimal server 
allocation policy for these systems. The results were fur- 

'We order two discrete time random processes A = {A(t)}'^_ 1 and 
B = {B(t)}'^L 1 stochastically as follows: We say A is stochastically 
less than B and we write A < 3t B if Pr(A(t) > r) < Pr(B(t) > r) 
for all t = 1,2, ... and all r£l. The notion and relevant properties will 
be discussed in more detail in Section ITlI-B I 



ther extended in [14| for more general stationary channel 
distributions (and not just i.i.d. Bernoulli channels). 

The authors in [8| considered a queueing model with 
a set of symmetrical parallel queues competing for K 
identical servers. The connectivity of each queue to all 
the servers is assumed to be the same at each time slot 
and during each time slot, each queue can attract at most 
one server. The authors proposed LCQ policy in which the 
servers are allocated to the K longest connected queues at 
each time slot. They proved the optimality of LCQ policy 
by using dynamic coupling and stochastic ordering method. 

The work in (9), iflOl . Ifl2l . [15| focuses on the opti- 
mal server allocation problem in multi-queue multi-server 
queueing systems in terms of average queueing delay. In 
ED, ED, OS, the authors introduced MTLB (Maximum- 
Throughput Load-Balancing) policy and showed that this 
policy minimizes a class of cost functions including total 
average delay for the case of two symmetric queues. The 
work in lfl2ll considers this problem for general number 
of symmetric queues and servers. In lfl2l . a class of Most 
Balancing (MB) policies was characterized among all work 
conserving policies which are minimizing, in stochastic 
ordering sense, a class of cost functions including total 
average delay. Note that in the model used in J9), IflOl . 
Ifl2l . ff3l . fl5l . there is no restriction on the number of 
servers that are serving a queue at each time slot. In Q, 
it was shown that for a multi-queue multi-server system in 
which queues are restricted to attract at most one server 
at each time slot, Maximum Weighted Matching (MWM) 
policy is throughput optimal. The authors also considered 
the effect of infrequent channel state measurements on the 
stability region. 

The rest of the paper is organized as follows. Section ITTI 
describes the model and the notation required throughout 
the paper. In section [TTTJ we introduce Maximum Weighted 
Matching (MWM) policy as the optimal policy for the 
described model. We will also review the concepts of 
stochastic ordering and dynamic coupling method which 
are the main mathematical tools used in proving the op- 
timality of MWM policy. In section [IV] we present the 
main result of this paper, that is proving the optimality of 
MWM server assignment policy. Section W\ summarizes the 
conclusions of the paper. 

II. Model Description 

We consider a time slotted parallel queueing system with 
a set of parallel symmetrical queues TV = {1,2,..., N} 
and infinite buffer space for each queue. Packets in this 
system are assumed to have constant length and require 
one time slot to complete service. The service to this set of 
queues is provided through a set of identical servers namely 
JC = {1, 2, K}. The connectivity of each queue n E TV 
to each server k E JC at each time slot t is random and 
follows a Bernoulli distribution. We denote the connectivity 
of queue n to server k at time slot t by C n ^(t). Note that 
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Fig. 1: Discrete time queueing system with TV parallel 
queues and K servers 



C n<k (t) G {0, 1} and E[C n<k (t)] = p for all n E TV and 
k E JC and t = 1,2,.... 

At each time slot, each server can serve at most one 
packet from a connected non-empty queue. Note that in 
the system we do not have server sharing i.e., a server can 
serve at most one queue at each time slot. We also assume 
that a queue which is being serviced by a server at a given 
time slot, cannot get service from other servers during the 
same time slot. 

Let A n (t) be the packet arrival process (number of 
packet arrivals) to queue n at time slot t. We assume that 
new arrivals at each time slot are added to the queues at 
the end of the time slot. Assume that the arrival processes 
A n (i) at each time slot t are independent Bernoulli random 
variables with the same parameter for all n and t. We 
denote the length of queue n at the end of time slot t (i.e., 
after adding the new arrivals) by X n (t). In other words, 
X n (t) represents the number of packets in the nth queue 
at the end of time slot t (or beginning of time slot t + 1). 

A server assignment policy at each time slot determines 
an assignment of servers of set JC to the queues of set TV. In 
other words, at each time slot the scheduler has to decide 
about a bipartite matching (matching in bipartite graphs) 
between sets TV and JC. This should be accomplished 
based on the available information about the connectivities 
C n ,k(t) an d a ls° the queue length process at the beginning 
of time slot t (which is X(t - 1) = (X x {t - l),X 2 (t - 
1), Xw(t — 1))). For a given policy tt, suppose that 
indicator variable I^l(t) is defined to be "1" if server 
k is assigned to queue n at time slot t and "0" otherwise. 
We define M^\t) = {/£2(*),Vn E TV, k E JC} as the 
employed matching by policy it at time slot t. Therefore, a 
server scheduling policy n is defined as n = 

According to the above discussion, we can see that the 
queue length random variable X n (t), Vn E TV evolves with 
time according to the following rule: 

x n (t) = (^x n (t - 1) - c^kWO*)) + A " W 



where (•)+ returns the term inside the brackets if it is 
non-negative and zero otherwise. Note that a server can 
be assigned to an empty queue however it cannot serve it 
since there is no packet to be served. That is why we have 
used operator (-)+ in ([]])■ 

As we discussed earlier, the queueing model introduced 
in this section is useful in modeling the resource assign- 
ment problem in various systems with shared resources. In 
wireless communication systems, communication resources 
such as communication sub-channels, relay stations, etc. 
are shared among users and therefore can be studied 
using our model (e.g. Q, [16]). Bipartite Matching also 
has been extensively used in literature (e.g. |3]-||6]) to 
model the scheduling problem in crossbar packet switching 
systems. In this paper, random variables are represented 
by CAPITAL letters and lower case letters are used to 
represent sample values of the random variables. 

III. Background 

A. Maximum Weighted Matching 

In 0], dH, El-imi, it was shown that Back-pressure 
algorithm maximizes the stable throughput region of a 
general data network. For the model introduced in section 
ITU Back-pressure algorithm is equivalent to solving the 
following optimization problem at each time slot t ||2). 

JV K 
Maximize ^ x n{t ~ 1 ) In,k{t)c n ,k{t) 



n=l 
K 



k=l 



s.t. J2 In > k( V ^ 1 (n=l,2..,N) 

fc=i 

N 

5^W*)<1 (fc = 1,2.., K) (1) 



where x n (t — 1) and c nj fc(i) are the values of random 
variables X n (t — 1) and C n ^k{t) at time slots t — 1 and 
t, respectively. Note that finding the solutions of problem 
(Q~|) is equivalent to finding a maximum weighted matching 
in the bipartite graph G t = (Af, /C, £ ) (see Figure [2]). 
In Gt, Af and /C are the two sets of vertices in each 
part of the graph and £ — Vn G Af,Vk e K.} is 

the set of edges between these two parts. Note that the 
associated weight to each edge e n> k is x n (t — l)cn k(t). 
A matching in graph Gt is basically a sub-graph of Gt 
in which no two edges share a common vertex. Note that 
any matching M'*'(i) at any time slot t is corresponding 
to a sub-graph of G t namely = (Af,fC,S^) in 

which e nt k £ £^ if and only if I^\(t) = 1. Suppose 

that M( MWM )(i) = {4 M , WM V),Vn 'g Af,k € £} be 
the matching whose indicator variables are the solution of 
the optimization problem (Q]). Thus, we define Maximum 
Weighted Matching (MWM) server assignment policy as 
MWM = {M^w^i)}^. There are several algorithms 
to find the maximum weighted matching in bipartite graphs. 
The most well known algorithm is Hungarian algorithm 
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Fig. 2: Bipartite graph corresponding to problem (Q]i 



whose complexity is of 0((min{iV, K})(max{N, K}) 2 ) 

ED. ' 

As explained before, MWM is known to be throughput 
optimal for the queueing system described in section ITT1 12 1. 
Our contribution in this paper is to prove that MWM is also 
optimal in minimizing, in stochastic ordering sense, a class 
of cost functions of the queue length processes including 
the total system occupancy (or equivalently total average 
queueing delay) for the symmetric queueing system of 
Figure Q] (which can be used to model a homogeneous 
wireless access network). We will introduce a detailed 
description of those class of cost functions in the following 
section. 

B. Stochastic Ordering and Dynamic Coupling 

In this section, we briefly review the concepts of stochas- 
tic ordering (stochastic dominance) and dynamic coupling 
techniques. Consider two discrete time stochastic processes 
A = {A{t)}%L x and B = {B{t)}%L x in R. We say A 
is stochastically less than B and we write A < st B if 
Pr(A(t) > r) < Pr(B(t) > r) for all t = 1,2,... and all 
r G K IF2TI . 11221 . Some properties of stochastic ordering 
are the following. If A < st B then f(A) < st f{B) 
for all non-decreasing functions /. If A < st B then 
E[A(t)] < E[B(t)]. A is stochastically smaller than B 
(A < st B), if there exists process A = {A^)}^ 
defined on the same probability space as B with the same 
probability distribution as A and satisfy A(t) < Bit) 
almost surely for every t = 1, 2, ... [8|. The last statement 
is known as coupling of A and A. In fact, when applying 
coupling technique, we are given the process A and we try 
to construct a coupled process A with the same distribution 
as A and A(t) < B(t) a.s. for all t. This gives us 
a tool for comparing processes A and B stochastically. 
This is specially useful when it is infeasible to derive the 
distributions of A and B (e.g. in our queueing model when 
comparing the total occupancy process for different server 
assignment policies). 

IV. Optimality of MWM 

In this section, we present the main result of this paper, 
that is proving the optimality of MWM with respect to 
minimization of a class of cost functions of queue lengths 



including the average queueing delay. Suppose that Z + 
be the set of non-negative integers and be the N 
dimensional Cartesian space of non-negative integers. We 
define relation " < " over as follows. 

Definition 1: For two vectors x , x E iff, we write 
x r< x if one of the following relations holds: 

Dl: x n < x n for all n = 1, 2, iV 
D2: x is obtained by permutation of two distinct elements 
of x, i.e., x and X are different in only two elements n 
and m such that x n = x m and x m = x n . 
D3: 5 and x are different in only two elements n and 
m such that a; n < x n < x m < x m and the following 
constraints are satisfied: x n = x n + 1 and :r TO = a; m — 1. 
In D3, we say that x is more balanced than x and can be 
obtained by decreasing a larger element of x (between m 
and n) by "1" and increasing a smaller element (between 
m and 77) by "1". We call such an interchange a balancing 
interchange on vector x. Thus, the result of a balancing 
interchange on a vector x would be a vector a; such that x < 
x. Suppose that vector x E represents the queue length 
vector at a given time slot. Then, a balancing interchange 
is equivalent to taking a packet from a larger queue and 
adding it to a smaller queue. 

We define the partial order " < p " on lff_ as the 
transitive closure of relation " ^< " ll23l . In other words, 
x ^< p x if and only if x is obtained from x by performing 
a sequence of reductions, permutations of two elements 
and/or balancing interchanges. When x and x are two 
queue length vectors, we write x < p x if and only if queue 
length vector x is obtained from x by applying a series of 
packet removal, two queues permutations and balancing 
interchanges. 

We define T as the class of real-valued functions on lff_ 
that are monotone and non-decreasing with respect to the 
partial order " < p ", i.e., 



X,(r-1)=3 



X <p x => f(x) < f(x). 



(2) 



N 



We can easily check that function f(x) = 2^2 
belongs to T. This function captures the total queue 
occupancy of the system. 

LetX'(t) = (X[(t), X^(t), X' N (t)) denote the queue 
length vector at time slot t exactly after serving the queues 
according to a server assignment policy tt and before 
adding the new arrivals of time slot t, i.e., 



(^(t-^)-^C n , k (t)lQ(t)\ . (3) 

Given x'(t) as a sample value of random variable X'(t), 
we define a balancing server reallocation at time slot t as 
follows: 

Definition 2: A balancing server reallocation on vector 
x'{t) is a matching that results in vector x'(t) such that 
one of the following conditions is satisfied. 
(CI): x' n (t) < x' n (t) for all n = 1,2,..., N and there 
exists m E {1,2, ...,N} such that x' m (t) < x' m (t). 



X 2 (t-l)=2 x 



X,(r-1)=3 



X 2 (t-1)=2 X 



X,(t-l)=5 A \ ^ X 3 (t-1)=5 A'\ 



(a) Satisfying condition CI (b) Satisfying condition C2 

Fig. 3: Examples of balancing server reallocations 



(C2): x'it) and x'(t) are different in only two elements 
77 and m such that x' n (t) < x' n (t) < x' m (t) < x' m (t) and 
the following constraints are satisfied: x' n (t) = x' n (t) + 1 
and x'Jt) = x' m (t) - 1. 

Figures [3a] and [3b] show two examples of balancing server 
reallocations in two sample graphs. In these figures, the 
original allocations are specified by solid lines while the 
balancing reallocations are specified by dashed lines. 

Consider an arbitrary server assignment policy it with the 
allocation var iables {I^lit)}^ for all fc E K and n £ Af. 
We introduce Matching Weight (MW) index associated to 
a server allocation policy tt at time slot t by 



N 



K 



Mw,(o = J2 *»(*- i)E^(*) 7 S (*) 



(4) 



fc=l 



Note that MW index is exactly the objective of the op- 
timization problem (|T). According to Definition [2] and 
definition of MW index, we can prove the following 
Lemma. 

Lemma 1: For a given policy it employing matching 
M^ n \t) at time slot t, by applying a balancing server 
reallocation at time slot t (if there exists any) we will have 
a new policy n differing from tt only at time slot t such 
that MW T (t) < MWjf(i). 

The proof is omitted here due to space limitations. The 
detailed proof of the lemma is given in [24|. Based on 
Lemma Q] we can state the following corollary. 

Corollary 1: For a given policy tt at time slot t, if 
MW^Tj) is maximized, i.e., policy tt employs a maximum 
weighted matching at time slot t, then there exists no 
balancing server reallocation at that time slot. 

Note that Lemma [T] just states that any balancing real- 
location increases the matching weight index. However, it 
does not imply the existence of a balancing server reallo- 
cation when M\N n (t) is not maximized. In the following, 
we will prove the reverse of Lemma [T] 

Lemma 2: For a given policy n at time slot t, if MW^i) 
is not maximized, i.e., MW 7r (t) < MWmwm(£), then there 
exists a balancing server reallocation at that time slot. 

The proof is lengthy and is omitted here due to space 
limitations. For the detailed proof, please refer to ll24l . 

By n MWM , we denote the set of all policies who employ 
maximum weighted matching at all time slots. We also 



define II t as the set of all policies that employ maximum 
weighted matching exactly until time slot t (including t). 
We can easily observe that II t C II t _ 1 and n MWM = 
nS=i n*- From Lemmas [T] and |2 we conclude that given a 
policy tt G n t _i which is using an arbitrary matching at 
time slot t, we can reach to a policy tt* G n t by applying 
a sequence of balancing server reallocations. Suppose that 
represents the number of balancing server reallocations 
required to convert the employed matching in policy tt at 
time slot t to a maximum weighted matching. In this case, 
we say that the distance of tt from II t is balancing 
server reallocations. Note that if the distance of tt from n t 
is hf, after applying the first balancing server reallocation, 
we get to a policy tt whose distance from II t is — 1 
balancing server reallocations. By repeating this procedure 
we finally get to a policy whose distance to n t is zero, i.e., 
it belongs to IT t . By ITj 1 (0 < h < h*) we denote the set 
of all server assignment policies in LT t _i whose distance 
from H t is at most h balancing sever reallocations. Note 

that n° = n t . 

Consider any two policies tt and tt such that f(X) < s t 
f(X), f G T where X = {X(t)}? =1 and X = {X(t)} t -j 
are the queue length processes when policies tt and tt are 
applied respectively. For such a system, we say policy 
tt dominates tt. Therefore, if tt dominates tt we have 
E[f(X)] < E[f(X)}. Given f{x) = J2n=i^, we 
conclude that the average queue occupancy (or equivalently 
average queueing delay) of policy tt is smaller than that of 
policy tt. According to the above discussion, we can prove 
the following Lemma. 

Lemma 3: For any policy tt G IlJ 1 and < h < we 
can construct a policy tt G LTj 1 such that tt dominates tt. 

Here, we just give the outline of the proof. For the 
detailed proof please refer to 11241 . The proof follows 
by applying dynamic coupling method over random 
variables C(t) = (C„ )fc (i)), Vn G Af,Vk G K and 
A(t) = (Ai(t),A 2 (t),...,A N (t)). In other words, we 
will show that given an arbitrary sample path u = 
(a;(0), c(l), o(l), x(l), c(2), o(2), x(2), c(3), o(3), z(3)...) 
we can construct policy tt and a new sample path ui = 
(5(0), 5(1), 2(1), 5(1), 5(2), 2(2), 5(2), 5(3), 5(3), 5(3), ...) 
resulting in a new sequence of random variables 
(X(0), (7(1), 1(1), 1(1), (7(2), 1(2), X(2), (7(3),...) with 
X(0) = X(0) such that 5(t) < p x(t) for all t. In fact, we 
construct w and tt G IIj 1 in such a fashion that /or a// 
the sample paths and all time slots we have x(t) < p x(t). 
The construction of tt is consisting of two main steps: 
construction for time slots before and including t and 
construction for time slots after t. The construction before 
and including t follows by using the matchings of policy 
tt for time slots before t. For time slot t, we apply the 
balancing server reallocation. The construction after t 
follows by using mathematical induction. The detailed 
proof is lengthy and is omitted at this point. We refer the 
interested readers to [24] for more detail. 

Based on Lemma|3] we can prove the main result of this 



paper in the following Theorem. 

Theorem 1: Maximum Weighted Matching policy dom- 
inates any server assignment policy. 

Proof: Let ttq be any arbitrary policy. Then ttq G Ilo = 
II^ 1 where Hi — h\ a . By applying Lemma [3] repeatedly, 
we can construct a sequence of policies such that each 
policy dominates the previous one. Thus, we obtain policies 
that belong to n = nf 1 , nf nf 1 ~ 2 , IT? = n x . 
The last policy is called tt\. Note that tt\ G II^ 2 where 
i?2 = h^ 1 ■ By recursively continuing such argument we 
obtain a sequence of policies n t G II f , t ~ 1,2,... such 
that TTj dominates TTi for j > i. Note that this sequence 
of policies defines a limiting policy tt* that agrees with 
MWM at all time slots. Thus, tt* is an MWM policy who 
dominates all the previous policies, including the starting 
policy 7r . ■ 

V. Conclusions 

In this paper, we considered the problem of assignment 
of K identical servers to a set of N parallel queues in 
a symmetrical time slotted queueing system with random 
connectivities from the queues to the servers. For such a 
queueing system, it has been previously shown that MWM 
is throughput optimal, i.e. has the maximum stability re- 
gion. Our contribution in this work is the development of a 
method to prove the optimality of MWM in minimizing, in 
stochastic ordering sense, a class of cost functions of queue 
lengths (including total queue occupancy or equivalently 
average queueing delay). Our method to achieve this goal 
used stochastic ordering and dynamic coupling techniques. 
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